Incorporating Metadata into Dynamic Topic Analysis

نویسندگان

  • Tianxi Li
  • Branislav Kveton
  • Yu Wu
  • Ashwin Kashyap
چکیده

Everyday millions of blogs and micro-blogs are posted on the Internet These posts usually come with useful metadata, such as tags, authors, locations, etc. Much of these data are highly specific or personalized. Tracking the evolution of these data helps us to discover trending topics and users’ interests, which are key factors in recommendation and advertisement placement systems. In this paper, we use topic models to analyze topic evolution in social media corpora with the help of metadata. Specifically, we propose a flexible dynamic topic model which can easily incorporate various type of metadata. Since our model adds negligible computation cost on the top of Latent Dirichlet Allocation, it can be implemented very efficiently. We test our model on both Twitter data and NIPS paper collection. The results show that our approach provides better performance in terms of held-out likelihood, yet still retains good interpretability.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Using POMDPs to Forecast Kindergarten Students' Reading Comprehension

Using POMDPs to Forecast Kindergarten Students’ Reading Comprehension . . . . . . . . . . . . . . . . . . . . 1 Russell Almond, Umit Tokac and Stephanie Al Ortaiba High-Level Information Fusion with Bayesian Semantics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8 Paulo Costa, Kathryn Laskey, Kuochu Chang, Wei Sun, Cheol Park and Shou Matsumoto Goal-Based Person T...

متن کامل

Time-Varying Dynamic Topic Model: A Better Tool for Mining Microblogs at a Global Level

Inthispapertheauthorsbuildonpriorliteraturetodevelopanadaptiveandtime-varyingmetadataenableddynamictopicmodel(mDTM)andapplyittoalargeWeibodatasetusinganonlineGibbs samplerforparameterestimation.Theirapproachsimultaneouslycapturesthemaximumnumberof inherentdynamicfeaturesofmicroblogstherebysettingitapartfromotheronlinedocumentmining metho...

متن کامل

A damage model incorporating dynamic plastic yield surface

In this paper, a general elastoplastic-damage constitutive model considering the effect of strain rate has been developed. The derivation of this model has been cast into the irreversible thermodynamics with internal variables within the fundamentals of Continuum Damage Mechanics (CDM). The rate effect has been involved as an additional term into the plastic yield surface (dynamic plastic yield...

متن کامل

Unsupervised Feature-Rich Clustering

Unsupervised clustering of documents is challenging because documents can conceivably be divided across multiple dimensions. Motivated by prior work incorporating expressive features into unsupervised generative models, this paper presents an unsupervised model for categorizing textual data which is capable of utilizing arbitrary features over a large context. Utilizing locally normalized log-l...

متن کامل

Probabilistic Models of Topics and Social Events

Structured probabilistic inference has shown to be useful in modeling complex latent structures of data. One successful way in which this technique has been applied is in the discovery of latent topical structures of text data, which is usually referred to as topic modeling. With the recent popularity of mobile devices and social networking, we can now easily acquire text data attached to meta ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012